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Abstract 

The iterative decoding threshold of low-density parity-check (LDPC) codes over the binary erasure 
channel (BEC) fulfills an upper bound depending only on the variable and check nodes with minimum 
distance 2. This bound is a consequence of the stability condition, and is here referred to as stability 
bound. In this paper, a stability bound over the BEC is developed for doubly-generalized LDPC codes, 
where the variable and the check nodes can be generic linear block codes, assuming maximum a 
posteriori erasure correction at each node. It is proved that in this generalized context as well the bound 
depends only on the variable and check component codes with minimum distance 2. A condition is also 
developed, namely the derivative matching condition, under which the bound is achieved with equality. 

I. Introduction 

LDPC codes [1] have been intensively studied in the last decade due to their capability to 
approach the Shannon limit under iterative belief-propagation decoding. An LDPC code of length 
N and dimension K can be graphically represented as a bipartite graph, known as Tanner graph, 
with N variable nodes (VNs) and M > N — K check nodes (CNs) [2]. In the Tanner graph, the 
degree of either a VN or a CN is defined as the number of edges connected to it. A degree-n 
VN of an LDPC code can be interpreted as a length-^ repetition code, i.e., as a (n, 1) linear 
block code repeating n times its only information bit towards the check node decoder (CND). 
Instead, a degree-n CN of an LDPC code can be interpreted as a length-n single parity-check 
(SPC) code, i.e., as a (n,n— 1) linear block code. 

An extension of the concept of LDPC code is represented by doubly-generalized LDPC (D- 
GLDPC) codes [3], where the VNs and the CNs are allowed to be generic (n, k) linear block 
codes instead of repetition and SPC codes, respectively. If only the CND is generalized while 
all the VNs are repetition codes, then the code is said a generalized LDPC (GLDPC) code, or 
a Tanner code [2]. 

In a D-GLDPC code the codes used as VNs and CNs are called component codes. In this work 
each component code is supposed to be a linear block code having a minimum distance d m \ n > 2. 
The VNs and the CNs which are not repetition or SPC codes, are referred to as generalized 
nodes. The corresponding code structure is depicted in Fig. [Q An (n, k) generalized VN is 
characterized by n connections towards the CND; moreover, k of the N D-GLDPC encoded 
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Fig. 1. Structure of a D-GLDPC code. 



bits are associated with the VN, and interpreted by the VN as its k information bits. Then, the 
codeword length of a D-GLDPC code with N v VNs is N = Yli=i ki {ki being the dimension of 
the i-th VN). An (n, k) generalized CN is characterized by n connections towards the variable 
node decoder (VND), and is associated with n — k independent parity check equations. Then, the 
number of parity-check equations for a D-GLDPC code with N c CNs is M = Yl!i=i{ n i ~ ki) {ki 
and rii being the dimension and length of the i-th check node, respectively). For a description 
of a D-GLDPC code iterative decoder over the AWGN channel and BEC we refer to [3] and 
[4], respectively. 

For LDPC code ensembles, an important role is played by a theorem known as the stability 
condition [5]-[7]. The most important consequence of the stability condition is the possibility 
to upper bound the asymptotic iterative decoding threshold. If the communication channel is a 
BEC with erasure probability q, the stability condition leads to the following upper bound on 
the asymptotic threshold q* for the LDPC ensemble: 

g* < [AW(l)]- 1 - (D 
The inequality (OQ) is referred to as stability bound in this papejj]. In CO, A'(0) = A 2 is the fraction 

'We use the nomenclature stability bound as, even though Q} is sometimes referred to as the stability condition, strictly 
speaking it is a consequence of it. For more details we refer to [7, Theorem 3.66] and the related discussion. 
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of edges connected to the length-2 repetition VNs, while p'(l) is the derivative (computed at 
x = 1) of the LDPC CNs degree distribution p(x) = Ylj>2 Pi 2 - 5-1 ' where pj is the fraction 
of edges connected to SPC CNs of length j. The bound (OQ) was first developed from density 
evolution. Next we propose a simple graphical interpretation of (OQ) using extrinsic information 
transfer (EXIT) charts [8]. Let us denote by I a the average a priori mutual information in input 
to the VND or to the CND. Furthermore, let us denote by Ie,v(Ia, q) and Ie,c(Ia) the average 
extrinsic information for the VND and CND respectively (these functions are usually referred to 
as EXIT functions). Then, (OQ) is equivalent to the following condition: for q = q*, the derivative 
of the VND EXIT function Ie,v(Ia,q), with respect to I a and evaluated at I a = 1, must be 
smaller than the derivative of the inverse CND EXIT function I^} C {I A ) evaluated at I A = 1. 
That is (OQ) is equivalent to requiring 



dI E ,y(lA,q* 
dI A 



(2) 

i A =i 



i A =i ~ dI A 

There exist LDPC degree distributions achieving the bound (OQ) with equality, so that their 
threshold over the BEC assumes the simple closed form q* = [A'(0)p'(l)] _1 . For such LDPC 
distributions, the first occurrence of a tangency point between the VND EXIT function Ie,v(Ia, q) 
and the inverse CND EXIT function I^} c {Ia) appears at Ia = 1, i.e. 



I E y(l,<f)=l£cQ- 



9Ie,v(Ia,<1*) 



Ia=i dlA 



(3) 

I A =1 



For LDPC code ensembles characterized by VNs and CNs with degree at least 2, the first 
equality is always satisfied as both terms are equal to 1 . This occurs also for D-GLDPC codes 
with all variable and check component codes having a minimum distance rf min > 2 [9], which 
is an assumption of this paper. Then, only the second equality is considered in the sequel, and 
is referred to as the derivative matching condition. 

In this paper, the stability upper bound ([I]) and the derivative matching condition are extended 
to D-GLDPC codes (and to GLDPC codes as a sub-case). Our derivations lead to the conclusion 
that only the check and variable component codes with minimum distance d min = 2, including 
length-2 repetition codes and SPC codes, contribute to the stability bound. We also show that 
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for D-GLDPC codes satisfying the derivative matching condition the asymptotic threshold over 
the BEC can be expressed by a simple formula. 

The paper is organized as follows. Some definitions and the notation used in the paper are 
introduced in Section UH In Section [Till the possibility to reduce the rank of a linear block 
code generator matrix by column elimination is discussed. Using these results, in Section [IV] 
and Section [V] the stability bound is developed for GLDPC codes and for D-GLDPC codes, 
respectively. Final remarks are given in Section [VTJ 



II. Definitions and Basic Notation 

We assume as transmission channel a BEC with erasure probability q. For a bipartite graph 
with random connections, the extrinsic channel (that is the channel over which the messages are 
exchanged between the VND and the CND during the iterative decoding process) is modelled 
as a second BEC with erasure probability p depending on the decoding iteration [10], where it 
is readily proved that I a — 1 — P- Since we express both the VND and the CND EXIT functions 
as functions of p (and q for the VND), their derivatives are evaluated at p = (corresponding 
to I a = 1). In this case © becomes 



dI E ,v(p,q*) > dI E ,c(p) 1 



dp 



p=o dp 



(4) 

p=0 



Under the hypothesis of a random bipartite graph, the VND and CND EXIT functions can be 
expressed as 



Ie,v (p,q) = ^\ i { e,v (p» q) ( 5 ) 

i=l 



and 



^c(p)=X>igc(p), ( 6 ) 
i=i 

respectively, where Xy and Xq are the number of different VN and CN types, I^) v (p, q) and 
I^Ecip) are me EXIT functions for the i-th VN type and for the z-th CN type, respectively, and 
Aj and pi are the fractions of edges towards the VNs of type i and the CNs of type %, respectively. 
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For the sake of clarity, it is useful to isolate the contribution of the repetition component codes 
in © and the contribution of the SPC component codes in ©, so that 

(rep) (gen) 

Ib,v(p, ?) = E A ? CP) • " 9f + E A * 7 tVb> 9) 

i>2 i 
(rep) (gen) 

= J2 A f P) - 9 A -p(p) + E A * J f v (p, ?) (7) 

j>2 i 
(SPC) (gen) 

Ib,c(p) = E -°f C) • (! -P)^ 1 + E « 

i>2 i 
(gen) 

= Pspc(l-p) + Eft J Sc(p)- (8) 

i 

In ©, j is the length of the generic repetition VN, X^' is the fraction of edges connected to 
the repetition VNs of length j, and A rcp (x) = Ylj>2 We use in © the well known 

EXIT function expression over the BEC for a (j, 1) repetition VN, i.e. Ie(p,q) = 1 _ qpi~ l . 
The second summation in ([7]) is over all the generalized VN types. Analogously, in ([8]) j is the 
length of the generic SPC CN, Pj SPC ' ) is the fraction of edges towards the SPC CNs of length 
j, Pspc(^) — Z)j>2 pf PC ^ ^■ J ' -1 > an d we use the well-known EXIT function expression over the 
BEC for a (j , j - 1) SPC CN, i.e. I E (p) = (1 - P y~\ 

The EXIT function of an (n, k) generalized VN over the BEC, when maximum a posteriori 
(MAP) erasure correction is performed at the VN, can be expressed as 

1 n— 1 k 

I E (p, g) = l--£E a ^ P* - P) n_t_1 ^ ( X " ti k ~ Z ' < 9 > 



n 

f=0 2 = 



which can be readily obtained from [10, eq. 36] with 

<h,z = [{n ~ t) e n -t,k-z -(t + l) e n _ t _i ifc _J. 

The parameter e g ^ (with g = 0, . . . , n and h = 0, . . . , k) is known as the (g, h)-th un-normalized 
split information function, defined as explained next. Considering a representation G of the 
generator matrix for the (n, k) VN, and appending to it the (k x k) identity matrix e 9t h 
is equal to the summation of the ranks over all the possible submatrices obtained selecting g 
columns out of G and h columns out of Ife. We remark that the split information functions for 
a generalized VN, and therefore its MAP EXIT function ©, depend on the chosen generator 
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matrix representation [9]. Then, the performance of the overall D-GLDPC code depends on the 
code representation used for the variable component codes. For the same reason, two generalized 
VNs associated with the same code, but with different generator matrices (i.e. different mappings 
between information words and codewords) must be regarded in © as VNs of different types. 

The EXIT function of a generalized (n, k) CN over the BEC, when MAP decoding is 
performed at the CN, can be obtained by letting q — > 1 in ©. The obtained expression, equivalent 
to [10, eq. 40], is 

.. n—l 

I E ( p ) = l--J2a t p t (l- P r- t -\ (10) 
n L — ' 

t=o 

with 

a t = (n- t) e n - t ~(t+ l)e n -t-i 

For g = 0, . . . , n, e g is known as the g-th un-normalized information function of the (n, k) 
code, a concept first introduced in [11]. It is defined as the summation of the ranks over all the 
possible submatrices obtained by selecting g columns out of the generator matrix G. As opposed 
to the split information functions e g ^, the information functions e g are independent of the code 
representation. Thus, different check component code representations are associated with the 
same EXIT function for the generalized CN. The performance of a GLDPC or D-GLDPC code 
is then independent of the specific representation of its generalized check component codes. 

Let us suppose that a generic VN is a (n, k) linear block code C, with generator matrix G. 
We denote by C the (n + k,k) linear block code generated by [G|Ife]. The generic codeword of 
C is denoted by c, while the generic codeword of C by c'. We have c' = [c|u], where c and u 
must satisfy c = uG: the code C then depends on the chosen generator matrix representation 
for C. It is readily shown that d' min > d min + 1, where d mhl and d' min are the minimum distances 
of C and C, respectively. 

III. Reducing a Generator Matrix Rank by Column Elimination 

For a given (n, k) linear block code C and for a given representation G of its generator matrix, 
we denote by S t a generic submatrix obtained by selecting t columns out of G, and by S t the 
submatrix composed of the n — t remaining columns. 
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Definition 1: We say that S t covers a non-null codeword c G C when there are no '1' positions 
of c corresponding to columns belonging to St. 

Example 1: Let us consider a (7,3) simplex code with generator matrix 



and let us denote by S2 the submatrix composed of the last two columns of G. Then, the only 
non-null codeword covered by S 2 is [0, 1, 1, 1, 1, 0, 0]. 

The following theorem states that in order to reduce the rank of a given generator matrix by 
column elimination, it is necessary and sufficient that the removed pattern of columns covers at 
least one non-null codeword. 

Theorem 1: Let us consider an (n,k) linear block code C. For any generator matrix repre- 
sentation, we have rank(5 t ) < k if and only if St covers at least one codeword. 

Proof: [Sufficiency] Suppose that S t covers a codeword c, and consider a representation G 
of the generator matrix where c is one of the rows. It follows that removing from G the n — t 
columns associated with S t reduces the rank because at least one of the rows becomes an all-zero 
row, so that rank(iS 4 ) < k. Since any representation of the generator matrix can be obtained from 
any other representation by row summations only, and since row summations cannot modify the 
rank of submatrices composed of generator matrix columns, we have rank(S t ) < k also for any 
representation other than G. 

[Necessity] Conversely, let us suppose that rank(iS() < k for a given generator matrix repre- 
sentation. Using the same argument as for the sufficiency, we observe that this inequality must 
be satisfied also for any other representation of the generator matrix. As removing St from any 
generator matrix leads to a (k x t) matrix with reduced rank, it must be possible to obtain (from 
any generator matrix representation) a generator matrix where one or more rows have only '0' 
in those positions corresponding to St. All these rows are non-null codewords of C covered by 
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□ 



Corollary 1: We have rank(«S t ) = k for all S t if and only if n — t < d, 
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Proof: [Sufficiency] Let us suppose that rank(«S t ) = k for all S t . By applying Theorem [T] 
it follows that no submatrix S t (composed of n — t columns) can cover any codeword. Then 



[Necessity] Conversely, let us suppose that n — t< <i m i n . Then, no submatrix S t (composed of 
n — t columns) can cover any codeword. By applying Theorem Q] we conclude that rank(<S t ) = k 



Example 2: All the codewords of the (7, 3) simplex code of Example [Q have Hamming weight 
4. As one of these codewords is [1, 1, 1, 0, 0, 0, 1], Theorem[T|guarantees that if we remove the first 
three and the last column from G given in Example Q] (or from any matrix obtained performing 
row summations on G) we obtain a (3 x 3) matrix with rank smaller than 3. On the other hand, 
by Corollary Q] we know that, even if we remove any set of three or less columns, the rank of 
G remains unchanged. 

In [9] the concept of independent set was introduced. Given a (k x n) rank-r binary matrix, 
an independent set of size s is defined as any set of s columns such that removing these columns 
from the matrix leads to a (k x (n — s)) matrix with a rank smaller than r. By Theorem Q] we 
now state that a necessary and sufficient condition for a set of s columns to be an independent 
set of a (k x n) generator matrix is that the s columns cover at least one codeword. Moreover, 
by Corollary Q] we recognize that any set of s < d min columns cannot form an independent set 
for the generator matrix. 

IV. Stability Bound and Derivative Matching for GLDPC Codes 

In GLDPC codes all the variable component codes are repetition codes, which in © leads to 
^(rep) A (re P ) = x The EXIT function over the BEC for the VND is then given by I E ,v(p, q) = 
1 — q\ rep (x). It follows 




for all S t . 



□ 



dI E ,y(p,q) 
dp f 

From d8]), the derivative of Ie,c(p) at p = is 




(ID 



d! E ,c(p) 
dp 



(gen) 



p=0 




+ 



^2 Pi 



dp 



p=0 



(12) 
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In order to develop (1121) it is necessary to explicit the derivative of each generalized CN type 
EXIT function. This task can be performed by exploiting Corollary [T] as explained next. 

Consider an (n, k) generalized CN with EXIT function Ie(p) in the form (flOl) . It is readily 
shown that 



dI E (p) 



dp 



(n — l)ao — a-i 



P =o n 

We have do = if and only if the generalized CN has minimum distance rf min > 2. In fact, 
the generator matrix of the check component code is full rank (rank = k) by definition, so 
e n = k. Furthermore, from Corollary [H removing any single column from the generator matrix 
does not reduce the rank if and only if d m m > 2, in which case we obtain e n -k = nk, so that 
a = n e n — e n _i = nk — nk = 0. As recalled in Section U the hypothesis d min > 2 is always 
assumed in this paper. Then, we can assume ao = 0. 
If cLin > 2 for the CN we obtain 



Cti 

P =o n 



dp 

where a x — (n — l)e n _i — 2e n _ 2 = kn (n — 1) — 2e n _ 2 - By applying again Corollary [Q we 
obtain 

/ =0 if d min >3 
cti < (13) 
[ > if d min = 2 . 

If the CN exhibits a minimum distance d min > 3, then removing any pair of columns from the 
generator matrix does not affect the rank. In this case 2e n _ 2 = 2 k (™) = kn(n — 1), hence 
ai = 0. 

According to these results, the only generalized CNs that contribute to the summation in the 
second term of (fl"2l) are those characterized by d min = 2. By recalling that all the SPC codes 
have minimum distance 2, we conclude that (fl"2)) only depends on the check component codes 
with d min = 2. The derivative at p = of the CND EXIT function can be then expressed as 



[2] 

&Ie,c{p) , m kiUiiui - 1) - 2 e„._ 2 

"PspcI-U / j Pi 



dp 



p=o '— ' rii 



[2] „ A (i) 

-Pspc(1)-E^^ W 
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where the notation J^ 2 ' is adopted to indicate the summation over those generalized CN types 
with minimum distance 2. In (fl4l) . we have denoted by A^_ 2 the expression fcjn^nj — 1)/2— e ni _ 2 , 
that does not depend on the chosen representation for the z-th generalized CN type. 

The next theorem states that A„_ 2 is equal to the multiplicity A$ of the weight-2 codewords 
for the CNs of type i. 

Theorem 2: For any linear block check component code with minimum distance d min = 2, 
the parameter A n _ 2 equals the multiplicity A 2 of the CN codewords with Hamming weight 2, 
i.e. 

A n _ 2 = A 2 . 

Proof: Let <S n _ 2 be the generic (kx (n — 2)) matrix obtained by removing 2 columns from 
(any representation of) the CN generator matrix. By Corollary Q] we have that either rank( l 5 ri _ 2 ) = 
k or rank(iS n _ 2 ) = k — 1: considering a CN with <i min = 2, removing any single column cannot 
reduce the rank so that removing two columns can reduce the rank at most by one. 

We have 

kn(n — 1) 
A n -2 = e n -2 

= ^2 k ~~ z\j rank ( 5 «-2) 

S n -2 S n -2 

= ^ ( k - rank ( 5 n-2)) , 

Sn-2 

where we know that each term in the summation is either equal to or to 1 . By Theorem Q] any 
such term is equal to 1 if and only if 5 n _ 2 covers a (necessarily weight-2) codeword. □ 

The derivative at p = of the inverse CND EXIT function I^cip) 

i s given by 1 / dl e{p) I 'dp | p =o . 
Combining CLU, (© and Theorem El for GLDPC codes, © becomes 



q < 



Ar P) (p'sPc(l) + E^ 



Hi 



(15) 
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We can further simplify (1151) by noting that 



as for a SPC CN rij = j and A$ 
compact form 



U) 



(SPC) 

E Pi u - !) 

j 

{S ™< 2 A? 



(o) = J (i ~~ Hence, (fl"5l) can be written in the more 



q 



< 



A 



[2] 9 
(rep) ^Pi 



"I -1 



4^ n,- 



Ai rep) C 



(16) 



where 



C 



[2] 

^2piCi 



with 



a 



2A 



(0 



/I; 



and where now J^ 2 ' indicates the summation over all the d min = 2 check component codes, 
both SPC and generalized. 

For GLDPC codes satisfying the derivative matching condition © (the first occurrence of 
a tangency point between Ie,v(p,q) and I^cip) appears at p = 0), the threshold assumes the 
simple closed form q* = [A^ 6 ^ C] _1 . If only generalized CNs with <i min > 3 are used, then (fl"5l) 
becomes q* < [X^ ^ Pspc(I)] -1 - ^ me derivative matching condition is fulfilled in this case, we 
obtain <f = [A^VspcW]- 1 - 

V. Stability Bound and Derivative Matching for D-GLDPC Codes 

The derivative at p = of the CND EXIT function of D-GLDPC codes is the same as 
for GLDPC codes, that is dl^ 1 (p) / 'dp\ p=0 = -1/(7. The partial derivative of the VND EXIT 
function with respect to p and evaluated at p = 0, is developed next. 

It follows from © that 



dI E ,v(p,q) 



dp 



(gen) 

£a, 



dI { Ey(p,q) 



p=0 



(17) 



p=o dp 
In order to develop the summation over the generalized VN types in the second part of (fTTT) . we 
have to explicit the partial derivative respect to p of each generalized VN type EXIT function, 
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evaluated at p = 0. To this end, let us consider an (n, k) generalized VN whose EXIT function 
is given by ©. After defining 

n-l 
t=0 

we have 



diE{p,q) 



dp 



n V d P p=°J 



p=0 - 2=0 



Q) 1 



y (n - l)a 0t , - a M _ fc 



n 

2=0 



as it is readily shown that that d/(p)/dp | p =o = — (n — l)a ,2 + ai, z . The expression (fT8l) can be 
further developed by invoking Corollary [TJ Since any variable component code has minimum 
distance d mhl > 2 by hypothesis, removing any single column from the generator matrix G of 
the variable component code cannot reduce the rank of G. It follows 



ao,z — ne nt k-z — e n -i,k 

( , J — kn ( 

k — z I \k — z 



thus leading to 



9Ie(p, q) 



' * n 



2 = 



dp 

Corollary \T\ can be invoked again in order to show that 



= V z if d min > 3 
ai, z S (19) 
> V z if d min = 2, 

where d m i n is the variable component code minimum distance. In fact, under the hypothesis 
rfmin > 3, removing any single column or any pair of columns from (any representation of) G 
cannot reduce its rank. Under this hypothesis 



^1,2 — (n — 1) en-i.fc-js — 2 e n _2,fe-2 

k — zl \n — 2J \k — z 



kn(n-l) , 1 — 2 Ac 



0. 
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Hence, the only generalized variable component codes contributing to (fTTI) are those with 
minimum distance d m ™ = 2. This is coherent with the fact that, among the repetition component 
codes, only those with rf min = 2 (i.e. the length-2 repetition codes) give a non-null contribution 
to 

Then, (fTTT) can be developed as 

dI E ,v(p,q) 



dp 



q a? cp) - E a, jr h<n " 1 2g *- 2 *-' ^ (i _ 



p=0 n n ' 

2 2=0 

[2] fei 2 A (i) 

In the previous expression the symbol indicates the summation over those generalized VN 
types with minimum distance 2. Moreover, A^_ 2k _ z is defined as fc ' n '^ i ~ 1 - > ( fc fc i J — e^-j^.,. 
As opposed to A^_ 2 in (fl4l) . A^_ 2 k _ z in (|20l) depends on the component code representation. 
Using (l20l) we can express © for a D-GLDPC code as 



[2] k, 2 * (i) 

a^ p) + E A > E — t 1 - ^ ??• (21) 

2=0 



In the reminder of this section, we prove that (121)) can be written as an explicit upper bound to 
the decoding threshold q*. We start by proving the following theorem. 

Theorem 3: Let us consider an (n, k) linear block variable component code with minimum 
distance d min = 2. We have 

k k 

E 2An ~ 2 ' k ~ Z (qT (1 - q*) k ~ Z = E — (<?T> (22) 

2=0 «=1 

where v4 2)U is the number of the VN weight-2 codewords generated by weight-u information 
words. 

Proof: Let C be the (n, fc) variable component code and let G be the chosen generator 
matrix for C. Moreover, let S n ^ 2 ,k-z be the generic (k x (n — 2 + k — z)) matrix obtained by 
selecting n — 2 columns in G and k — z columns in the (k x k) identity matrix. 

Let us apply Theorem \T\ to the code C introduced at the end of Section HO Each codeword 
c' G C is composed of the concatenation of a codeword c E C with one of the possible 
2 k sequences of k bits (where by the linearity of C the all-zero length-A; sequence is always 
concatenated with the all-zero codeword of C). Combining this observation with Theorem Q] 
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and introducing the notation S n _ 2 ^- Z = [^^Ul^ifc-J we observe that a necessary (though not 

Q 

sufficient) condition for having rank(iS n _ 2 ,fc-z) < k is that S n _ 2 covers a weight-2 codeword of 
C. 

Next, we develop A n ^ 2 ,k~z as 

kn{n — 1) / k \ 

A n -2,k-z — ^ ( L. J ~ e n-2,k-z 

= k ~ £ rank(5 n _2,fc_ z ) 

>Sn-2,fe — z <S n _2,fc-z 

= (A; - iank(S n - 2 ,k-z)) 

S n -2,k — z 

= EE( A; - rank ([' s « G 2i^-,])) 




= ^^(fc-rank([e 2 |5L])), 

where ^ s use( ^ t0 indicate the summation over those S^_ 2 such that S n _ 2 covers a weight-2 
codeword of C. Then, we can write 



2An ~ 2 ' k ~ z (q*y (1 - q*) k ~ z 

2=0 

? * I 2 ! 

= -EEE( fc - rank (p° a |5j_j)) o?t (i - ^) fe " 2 

2 = C z 

9 I 2 ! fc 

= -EEE( fc - rank fl 5 ° 2 i 5 *-J)) (?t (! - ?) k ~ z - ( 23 ) 

By hypothesis there are no VNs with minimum distance 1. Then, for a given weight-2 
codeword c E C, any submatrix S n ^ 2 ,k~z is such that S n ^ 2 ^ z can cover at most one codeword 
of C, i.e. the codeword [c|u c ] subject to c = u c G. If we denote by w^(u c ) the Hamming 
weight of u c , for each weight-2 codeword c G C the summation over z in (1231) can always start 
from wh(xi c ). In fact, for z — 0, . . . , w#(u c ) — 1 it is not possible for S n - 2 ,k-z t0 cover the 
codeword [c|u c ], so that k — rank ([S^_ 2 \Sl_ z ]) = 0. That allows writing the second member 
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in ([23]) as 

[2] 



l E E £ (* " rank (P° al 5 *-.])) (<?T (1 - q*) k ~ Z - (24) 



n 

c z=w H (u c ) Si 



For given z > wh{u c ), the codeword [c|u c ] is covered by exactly (^_^(„ c j) matrices 
S n -2,k-z- Hence, there are exactly c y non-null terms in 

(k - rank(5„ G 2 |Sj_,)) (1 - g*)*"' • 

Deleting from G two columns corresponding to a weight-2 codeword of C reduces the rank 
of this matrix by one, leading to a rank k — 1. In fact, considering the VN minimum distance 
dmin = 2, removing the first column cannot reduce the rank (Corollary [D and removing the 
second column reduces the rank (Theorem Q]) necessarily by one. We can then conclude that 
each of the Ql^^)) non-null terms in the summation J2s* ~ ran ^ {[^n-^l^l-zl)) * s e q ua l 
to one, independently of z. Then we can further develop (|24l) as 



"E t ( k ~ W " { r\)wY(l- q 'f- (25) 
n t-? ^-f . \z-w H (u c )y 

We next observe that those weight-2 codewords c E C associated with the same u>#(u c ) (i.e. 
generated by information words having the same weight) produce the same contribution in d25l) . 
since only the Hamming weight of the information words u c matters. This observation allows 
us to write (1251) as 

E^E (*""W(1 -O fc -*, (26) 
z — ' n ^— ' \z — u I 

u=l z=u x ' 

where v4 2jU is the number of weight-2 codewords c E C such that w#(u c ) = u. In general, A 2yU 
depends on the variable component code representation. By noting that 

E C-n) {qr (1 " ^ = ' 
we finally obtain (1221) . □ 

Theorem [3] allows us to write the first member of (1211) as 

I 2 ] h cy At) 

g*Ar p) + E^E^(^T- 



rii 

u=l 
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The length-2 repetition VNs can be embedded into the summation over the generalized VN types 
with minimum distance 2. In fact, the only weight-2 codeword of a length-2 repetition VN is 
c = [1, 1], which is generated by a weight-1 information word. Then, for a length-2 repetition 
VN we have 

A («P)^^i(gy = A &»P) q * 

u=l 

Hence, (|2TI) can be put into the more compact form 



rii C 

l u=l 

where now the summation J^' 2 ' is over all the VN types with minimum distance 2, both repetition 
and generalized. 

The first part of (1271) is a real polynomial P(-) in the variable q*. This polynomial can be 
written as P(x) = Y^f^ KPi(x), where Pj(-) is a degree-^ real polynomial associated with the 
^min = 2 type-? VNs. Each Pj(-) is a monotonically increasing function (since all its coefficients 
are positive). Consequently, P(-) is a monotonically increasing function and its inverse 
exists. We have then proved the following theorem, which is the main contribution of this paper. 

Theorem 4 (Stability bound over the BEC for D-GLDPC codes): The asymptotic threshold q* 
of a D-GLDPC code ensemble over the BEC, assuming MAP erasure correction at each com- 
ponent code, fulfills 



q* < P- 1 (J~) , (28) 



where 



and 



[2] fc, A (i) 

ZA ^ x u 



p{x) = J2 x i p i( x ) with p i( x ) = J2 



Hi 



C = J2 Ci With C * 



2 



71; 



□ 



For an LDPC code ensemble (1281) returns q* < [A'(0) p'(l)] x , i.e. the well-known stability 
bound for LDPC codes. 
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Property 1: For a length-2 repetition VN (i.e. for a conventional LDPC degree-2 VN) we 
have Pi(x) = x. Hence, if the only d min = 2 VNs are length-2 repetition codes, we have 
P-\x) = (l/A|> rcp) ) x. This is the case for GLDPC codes. 

Property 2: For a length-?^ SPC CN (i.e. for a conventional LDPC degree-?^ CN) we have 
Q = rii - 1. 

Property 3: Any length-rij and weight-2 binary sequence is a codeword for a length-?-^ SPC 
CN. Then, Cj for a length-rij CN with minimum distance 2 is maximum when the CN is a SPC 
code. In other words, Cj fulfills 

G% < rii - 1, 

where the equality holds when the CN is a SPC code. 

Property 4: For any VN with minimum distance d min = 2, the value of Pi(x) depends on the 
chosen generator matrix through the coefficients A^ u . This is true for all x, except at x = and 
at x = 1 where we have 

Pi(0) = 

and 

m) = = Ci 

respectively. Independently of the VN representation, the value assumed by Pi(x) at x — 1 is 
equal to the value of C, for the same <i m i n = 2 linear block code when used as a CN. 

Property 5: For a D-GLDPC code ensemble satisfying the derivative matching condition, the 
iterative decoding threshold over the BEC assumes the simple form q* = P~ l (\/C). 

It should be noted that in general P _1 (l/C) is not a closed form for the threshold. However, 
there are simple cases in which can be explicited. An example is provided in the appendix. 

VI. Conclusion 

In this paper, a stability bound over the BEC has been developed for D-GLDPC codes. It 
generalizes the inequality q* < [X' (0) p' (l)]^ 1 , valid for LDPC code ensembles. We have shown 
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that for D-GLDPC codes, as for LDPC codes, the only variable and check component codes 
contributing to the bound are those having minimum distance 2. A derivative matching condition 
sufficient to achieve the bound with equality has also been defined. If the derivative matching 
condition is fulfilled, then the decoding threshold over the BEC for D-GLDPC codes is expressed 
by a simple formula, although in general not in closed-form. For GLDPC codes this formula 
always leads to a closed-form threshold expression. 

Appendix I 

D-GLDPC Codes with SPC Variable Nodes 

GLDPC codes employing strong generalized CNs (such as Hamming or BCH CNs) represent a 
possible solution for obtaining a good compromise between waterfall performance and error floor. 
Examples of such GLDPC code constructions are described in [12]— [16]. In general, increasing 
the fraction of strong generalized CNs can be very favorable from the point of view of the 
overall code minimum distance and then of the error floor, but presents drawbacks. 

A first drawback is represented by an overall code rate loss which makes GLDPC codes 
with large fractions of strong generalized CNs of interest only for low or very low rate [17]. 
The reason is briefly reviewed next. Let us consider a more general code structure, namely a 
D-GLDPC code. If we denote by r V:i and by tqj the code rate of the type-2 VNs and of the 
type-j CNs, respectively, the overall design rate is 

fl=1 _ S,ft(l-r«,) 

which is monotonically increasing respect to any ry,i and to any rc,j- A generalized CN of length 
n has a code rate smaller than the code rate of a length-n SPC CN. Then, a large fraction of 
strong generalized CNs determines an overall rate loss. In GLDPC codes this rate loss is difficult 
to compensate even using large fractions of length-2 repetition VNs (which are the highest rate 
VNs available if all the node in the Tanner graph have minimum distance at least 2) so that 
usually the overall GLDPC code remains of low rate. A second drawback is that GLDPC codes 
with large fractions of strong generalized CNs and large fractions of length-2 repetition VN are 
typically characterized by a poor asymptotic threshold due to the large area gap between the 
EXIT curves in the EXIT function (see the Area Theorem in [18]). 

Allowing the generalization of the VND together with the generalization of the CND provides 
an increased flexibility in the code design, that can be exploited to overcome the above mentioned 
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limitations. In particular, the rate loss due to the generalized CNs can be compensated using 
generalized VNs with a code rate larger than 1/2. In this context, a special class of generalized 
VNs is represented by (n, n — 1) SPC VNs each one having n edges towards the CND and 
associated with n — 1 encoded bits. It is shown in [19] that these codes can be effectively 
exploited for the design of D-GLDPC codes with good waterfall and error floor performance. 
In this appendix, we develop the polynomial Pj(-) defined in Theorem 0] for such VNs when 
represented in both systematic and cyclic form. We also propose a numerical example illustrating 
the capabilities offered by D-GLDPC codes with SPC VNs. 



A. SPC Variable Nodes in Systematic Form 



Let us suppose that the VNs of type-z are length-rij SPC codes in systematic form, i.e., 
represented by the ((rii — 1) X rii) generator matrix 



1 





.. 


. 


1 





1 


.. 


. 


1 








1 . . 


. 


1 







1 1 



Each of these VNs has (^) weight-2 codewords. Specifically, there are n — 1 weight-2 codewords 
generated by weight-1 information words, (™ 4 2 ^ 1 ) = ( w '~ 1 ^ rt '~ 2 ) weight-2 codewords generated 
by weight-2 information words and no weight-2 codewords generated by information words of 
weight larger than 2. Then 



.4 



(0 

2m 



Tii — 1 if U — 1 

(rit- l)K-2)/2 if u = 2 



if u 



■ ■ , Ui 



so that 



PAx) = — • (rii - 1) x H 



2 K-l)K-2) 



x 



rii 



x 1 



rii 



x 



(30) 
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B. SPC Variable Nodes in Cyclic Form 

Let the VNs of type-i be (n^rij — 1) SPC codes in cyclic form, i.e. generated by 



1 


1 


. 


. 








1 


1 . 


. 











1 . 


. 






0... 11 



In this case we obtain an expression of Pi(x) different from (l30l) . In fact, it is readily shown that 
in a SPC code represented in cyclic form, an information word of weight u generates a weight-2 
codeword if and only if all its '1' positions are contiguous. Then, for all u = 1, . . . , rij — 1 we 
rij — u, from which 



have A® 



Pi(x) 



m—l 

E 

u=l 

m—i 



2(rii-u) „.« 



n, 



x 



2V1" — V«" 



u=l 



2x 



X 



u=l 

1 2x 1 — rii x Ui ~ x + (rij 

Hi 



l)x ni 



x — 1 

2x [x ni — rii (x — 1) — 1] 



[X 



(31) 



(x - 1) 

If rij = 2 or = 3, then (|30l) coincides with ( |3T| ) as expected. Specifically, from both ( |30l ) and 
(|3TI) we obtain Pi(x) = x and Pj(x) = | x 2 + | x for rii = 2 an d rz^ = 3, respectively. 



C. Comparison between Systematic and Cyclic Form 

Let us denote by P s (-) and by P c (-) the polynomial Pj(-) of a length-n SPC VN in systematic 
and cyclic form, respectively. We show next that if n > 3 



PJx) - PJx) < 



> if < x < 1 
= if x = 1 
< if x > 1 . 
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In fact, we have 



P S (X) 



Pc(x) = ~ 

n 

2x 2 



in 



^ (ra-l)(n-2) 2 
l)x+ - ^ x 2 



n-l 

n ^ 



n — u)x 



n 



(n-2)(n-3) 



n-l 

- u) x u ~ 2 

u=3 



It is readily shown that 



n-l. 



in — u) 



(n-2)(n-3) 



(32) 



. Then, P s (l) — -P c (l) = 0, a result which is 



consistent with Property HI For < x < 1 we must have J2u 



n-l. 



in — u) x 



u-2 



< 



(n-2)(ra-3) 



which 



(n-2)(n-3) 



leads to P s (x) — P c (x) > 0; analogously, for x > 1 we must have J2u =3 (n — u) x li 2 > 
which leads to PJx) — P r (x) < 0. 



D. D-GLDPC Codes with Length-2 Repetition VNs and SPC VNs in Systematic Form 

Let us consider (T28l) . Although in general it is not possible to express in an explicit 

closed form, this is possible in special cases. For instance, obtaining a closed form expression 
of -P _1 (-) is possible when the only d m - m = 2 variable component codes are length-2 repetition 
codes and length-n SPC codes in systematic form. Let A be the fraction of edges connected to 
the length-2 repetition VNs and ji the fraction of edges connected to the length-n SPC VNs (so 
A + ji is the total fraction of edges connected to <i mill = 2 VNs). We have 

t,< ^ x 2(n-l) fn-2 \ 

P(x) = A x + — x x + 1 . 

n V 2 J 

By solving for positive y the equation P(y) = x, we obtain 



-[nA + 2 (n-l) 0nA + 2 (n-l) /if + 4 (n-2) {n-l) n^ix 
(X) ~ 2 (n - 2) (n - 1) u + 2 (n - 2) (n - 1) ~jl ' ( } 

In Fig. [2l (1331) is plotted for different values of assuming A + [i = 0.3 and SPC VNs of 
length n = 7. Each curve is associated with a different value of fi, i.e., with a different proportion 
between length-2 repetition VNs and length-7 SPC VNs in the VND. Hence, the curve labelled 
as 0.0 corresponds to the presence of only length-2 repetition VNs, while the curve labelled as 
0.3 to the presence of only SPC VNs. Hence modifying u provides a wide variety of options. 
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Fig. 2. Plot of P for a D-GLDPC code where the only d min = 2 VNs are length-2 repetition and length-7 SPC VNs. 
The total fraction of edges connected to d m i n = 2 VNs is A + fi = 0.3, and each curve is associated with a specific value of /x. 



E. Distribution Optimization 

We consider the optimization problem of a GLDPC and of a D-GLDPC code ensemble for 
design rate R = 1/2. In both cases we constrain the optimization process by allowing the 
repetition VN degree to range only between 2 and 15 and the SPC CN degree only between 5 
and 15. Moreover, we use (31, 21) BCH CNs, imposing a minimum fraction of edges connected 
to the BCH CNs equal to 0.7. For the D-GLDPC code ensemble, we allow also length-15 
SPC CNs in cyclic form. The output of an optimization process over the BEC performed with 
differential evolution [20], [21] is reported in Table I (from an edge perspective). For each of 
the two optimized distributions the threshold and the stability bound (|28T) are shown. While for 
the GLDPC code ensemble it is necessary to use only length-2 repetition VNs to compensate 
the rate loss introduced by the large fraction of BCH CNs with an overall poor threshold, for 
the D-GLDPC code ensemble the use of SPC VNs allows obtaining a much larger threshold. 
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GLDPC AND D-GLDPC DISTRIBUTIONS WITH LARGE FRACTIONS OF BCH CHECK NODES 
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GLDPC 


D-GLDPC 


Variable Nodes 


SPCcyc 15 




0.521581 


rep 2 


1.000000 


0.132836 


rep 14 




0.145293 


rep 15 




0.200291 


Check Nodes 


BCH 


0.700000 


0.721799 


SPC 5 




0.278201 


SPC 12 


0.174190 




SPC 13 


0.125810 




q* 


0.291516 


0.478585 


P-\1/C) 


0.291902 


0.478585 



From an EXIT chart perspective the capability of the SPC VNs to reduce the area gap between 
the EXIT curves is illustrated by comparing in Fig. [3] and Fig. HI 
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